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4 <110> APPLICANT: DIVERSA CORPORATION 

5 Murphy, Dennis 

8 <120> ?ITLE of invention: ENZYMES HAVING ALPHA-GALACTOSIDASE ACTIVITY AND METHODS OP . 

USE THEREOF 

11 <130> FILE REFERENCE: DIVER1120-4 

13 <140> CURRENT APPLICATION NUMBER: 09/886,400 nm^w^W^W^W^ 

14 <141> CURRENT FILING DATE: 2001-06-20 Cm I I 

16 <150> PRIOR APPLICATION NUMBER: 09/619,032 B & ll 

17 <151> PRIOR FILING DATE: 2000-07-19 

19 <150> PRIOR APPLICATION NUMBER: 09/407,806 

20 <151> PRIOR FILING DATE: 1999-09-20 

22 <150> PRIOR APPLICATION NUMBER: 08/613,220 

23 <151> PRIOR FILING DATE: 1996-03-08 
25 <160> NUMBER OF SEQ ID NOS : 4 

27 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

29 <210> SEQ ID NO: 1 

30 <211> LENGTH: 52 

31 <212> TYPE: DNA 

32 <213> ORGANISM: Artificial Sequence 

34 <220> FEATURE: 

35 <223> OTHER INFORMATION: polynucleotide probe 

37 <400> SEQUENCE: 1 52 

38 ccgagaattc attaaagagg agaaattaac tatgagagcg ctcgtctttc ac 

40 <210> SEQ ID NO: 2 

41 <211> LENGTH: 31 

42 <212> TYPE: DNA 

4 3 <213> ORGANISM: Artificial Sequence 

45 <220> FEATURE: 

46 <223> OTHER INFORMATION: polynucleotide probe 

48 <400> SEQUENCE: 2 33_ 
4 9 cggaagatct aggttcccca ttttcacccc t 

51 <210> SEQ ID NO: 3 

52 <211> LENGTH: 1095 

53 <212> TYPE: DNA 

54 <213> ORGANISM: Thernococcus alcaliphilus 

56 <22 0> FEATURE: 

57 <221> NAME/KEY: CDS . 

58 <222> LOCATION: (1)...(1092) 

60 <4 00> SEQUENCE: 3 ^ yi p 

6? ^tg aga gcg etc gtc ttt cac ggc aac etc cag tat gcc gaa ate cca 48 
Leu A?g Ala Leu Val Phe His Gly Asn Leu Gin Tyr Ala Glu lie Pro 



62 

63 1 5 



10 



96 



65 aag age gaa ate eca aag gte ata gag aag gea tae ate eca gte ate 

66 LyI ser Ilu He Pro Lys Val He Glu Lys Ala Tyr He Pro Val He 

67 20 25 30 

69 gag aea ctg att aaa gaa gaa att eet ttt ggg etc aac ata acg ggc 144 

70 Glu Thr Leu He Lys Glu Glu He Pro Phe Gly Leu Asn He Thr Gly 
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71 35 40 45 

73 tat acc tta aag ttc etc ccg aag gat att ata gac etc gtt aaa ggg 

74 Tyr Thr Leu Lys Phe Leu Pro Lys Asp He He Asp Leu Val Lys Gly 

75 50 55 60 

77 ggc ate geg agt gac ctg ata gag ata ate gga aeg age tac aeg cac 

78 GlY He Ala Ser Asp Leu He Glu He He Gly Thr Ser Tyr Thr His 

i. 75 80 

79 65 70 '=> 

81 gca ata etc cee etc etc ccg ctt age aga gta gaa gca caa gtt cag 

82 Ala He Leu Pro Leu Leu Pro Leu Ser Arg Val Glu Ala Gin Val Gin 

83 85 90 95 

85 aga gat agg gaa gtt aag gaa gag etc ttc gag ctt tet cea aag gga 

86 Arg Asp Arg Glu Val Lys Glu Glu Leu Phe Glu Leu Ser Pro Lys Gly 

87 100 105 110 

89 ttc tgg ctg cea gag etc gee tat gae ccg ata ate cct gee ata ctg 

90 Phe Trp Leu Pro Glu Leu Ala Tyr Asp Pro He He Pro Ala He Leu 

91 115 120 125 

93 aag gae aac ggt tat gag tat eta ttc gee gac ggg gag geg atg ctt 

94 Lys Asp Asn Gly Tyr Glu Tyr Leu Phe Ala Asp Gly Glu Ala Met Leu 

95 130 135 140 

97 ttc tea get eat etc aac teg geg ata aag cea att aaa ccg etc tat 

98 Phe ser Ala His Leu Asn Ser Ala He Lys Pro He Lys Pro Leu Tyr 

99 145 150 155 160 

101 cea cac ctt ata aag gcc caa agg gaa aag egc ttt agg tac ate age 

102 Pro His Leu He Lys Ala Gin Arg Glu Lys Arg Phe Arg Tyr He Ser 

103 165 170 175 

105 tat etc ctt ggt etc agg gag ctt agg aag gcg ata aag etc gtt ttt 

106 Tyr Leu Leu Gly Leu Arg Glu Leu Arg Lys Ala He Lys Leu Val Phe 

107 180 185 190 

109 gaa ggt aag gta aeg eta aag gca gte aaa gac ate gaa gcc gta ccc 

110 Glu Gly Lys val Thr Leu Lys Ala Val Lys Asp He Glu Ala Val Pro 

111 195 200 205 

113 gtt tgg gtg gee gtg aac aeg get gta atg etc ggc ate gga agg ctt 

114 val Trp val Ala Val Asn Thr Ala Val Met Leu Gly He Gly Arg Leu 

115 210 215 220 

117 cct ctt atg aat cct aag aaa gtg gcg age tgg ata gag gae aag gae 

118 Pro Leu Met Asn Pro Lys Lys Val Ala Ser Trp He Glu Asp Lys Asp 

119 225 230 235 240 

121 aac att ctt eta tac ggc acc gat ata gag ttc att ggc tat agg gae 

122 Asn He Leu Leu Tyr Gly Thr Asp He Glu Phe He Gly Tyr Arg Asp 

123 245 250 255 
125 att gca ggc tac aga atg agt gtt gag gga tta tta gag gtt ata gae 
12 6 He Ala Gly Tyr Arg Met Ser Val Glu Gly Leu Leu Glu Val He Asp 
127 260 265 270 

129 gag etc aac teg gaa ctg tge ctt cee tca gag ctg aag cac agt gga 

130 Glu Leu Asn Ser Glu Leu Cys Leu Pro Ser Glu Leu Lys His Ser Gly 

131 275 280 285 

133 agg gag etc tac tta egg act teg agt tgg gca cea gat aag age ttg 

134 Arg Glu Leu Tyr Leu Arg Thr Ser Ser Trp Ala Pro Asp Lys Ser Leu 

135 290 295 300 



192 
240 
288 
336 
384 
432 
480 
528 
576 
624 
672 
720 
768 
816 
864 
912 
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150 
151 



a„ .t. t,, Ill III III III Z 

138 Arg He Trp Arg Glu Asp ^.lu y 

irlH sr.?i s - 
Hi '^i % z ^ S s» - s r/, f/. 

^ ^ aat taa agg ggt gaa aat ggg gaa 
149 ata tat aac gat tgg agg gg y 

lie Tyr Asn Asp Trp Arg «^xy ^ 
355 

154 <210> SEQ ID NO: 4 

155 <211> LENGTH: 364 

159 <400> SEQUENCE: 4 
160 

^ c nn Tie Pro Lys Val He Glu Lys Ala 

162 Lys Ser Glu lie fro 

163 20 _ 2b ^^^^ 
164 
165 
166 
167 



ctt aat atg 
Leu Asn Met 

gag aac age 
Glu Asn Ser 

gat gcc ttc 
Asp Ala Phe 
350 

cct tag 
Pro 



ctg tec 
Leu Ser 
320 
gat gca 
Asp Ala 
335 

egg gcg 
Arg Ala 



<400> SEQUhiWun.: h . ^ t^,, rin 

Leu Arg Ala Leu Val Phe His Gly Asn Leu Gin 



Glu Thr Leu He Lys Glu Glu lie Pro Phe Gly 
Tyr Thr Leu Lys Phe Leu Pro Lys Asp He He 
III Gly fie Ala Ser Asp Leu lie Glu He He Gly 
\% fla He Leu Pro Leu leu Pro Leu Ser Arg Val 
i72 Arg Asp Arg Glu Val Lys Glu Glu Leu Phe Glu 



Tyr Ala Glu 

Tyr He Pro 
30 

Leu Asn He 
45 

Asp Leu Val 
60 

Thr Ser Tyr 
Glu Ala Gin 



1^4 Ph. Trp Leu pro =lu l^u Ala Tyr Asp Pro He 



"7I Lys ASP AS^ Oly Tyr <=lu Tyr Leu Phe Al. Asp 
f,8 Phe III Ala His Leu Asn ser Ala He Lys Pro 
III "ro His Leu Ue Lys Al. alh Ar, alu Lys Ar, 
is2 Tyr Leu L.U Gly i" Ar, Glu Leu Arg Lys Ala 
i" Glu Gly Lys Z Thr Leu Lys Al. val Lys Asp 



120 



185 
186 
187 
188 

189 225 



val Trp ial Ala Val Asn Thr Ala Val Met Leu 



pro Leu Met Asn Pro Lys Lys Val Ala Ser Trp 

230 

III lie Leu Leu Tyr Gly Thr Asp He Glu Phe 



Leu Ser Pro 
110 

He Pro Ala 

125 
Gly Glu Ala 
140 

He Lys pro 

Phe Arg Tyr 

He Lys Leu 
190 

He Glu Ala 

205 
Gly He Gly 
220 

He Glu Asp 



He Pro 
15 

Val He 

Thr Gly 

Lys Gly 

Thr His 
80 

Val Gin 
95 

Lys Gly 



He Leu 
Met Leu 



He Gly Tyr 



Leu Tyr 
160 
He Ser 
175 

Val Phe 

Val Pro 

Arg Leu 

Lys Asp 
240 
Arg Asp 
255 



960 



1008 



1056 



1095 



191 
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132 lie .la Oly Tyr .r. Met Ser Val Olu Cl. Leu Leu Olu Val He Asp 
III olu Leu Asn ^S^^ Olu Leu C.s Leu P.o Se. Clu Leu Lys His Ser Cly 

195 275 ^ p ser Leu 

196 Arg Glu Leu Tyr Leu Arg Thr Ser Ser Trp Al 

197 290 ^ Leu Asn Met Leu Ser 

198 Arg He Trp Arg Glu Asp Glu Gly Asn Ala Arg i.e 

199 305 31° T Ala Glu Asn Ser Asp Ala 

200 Tyr Asn Met Arg Gly Glu Leu Ala Leu Leu Ala c 

201 „ zvT-tT Ara Leu Asp Ala Phe Arg Ala 

202 Arg Gly Trp Glu Pro Leu Pro Glu Arg Arg Leu A p 



204 lie Tyr Asn Tsp Trp Arg Gly Glu Asn Gly Glu Pro 

205 355 360 



340 
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